Methods Inf Med 2010; 49(05): 467-472
DOI: 10.3414/ME09-02-0052
Special Topic – Original Articles
Schattauer GmbH

Investigation of an Automatic Sleep Stage Classification by Means of Multiscorer Hypnogram

V. C. Figueroa Helland
1   Interdisciplinary Center for Dynamics of Complex Systems, University of Potsdam, Potsdam, Germany
,
A. Gapelyuk
2   Department of Physics, Humboldt-Universität zu Berlin, Berlin, Germany
3   Charité – Universitätsmedizin Berlin, Campus Berlin-Buch, Experimental and Clinical Research Center, Berlin, Germany
,
A. Suhrbier
3   Charité – Universitätsmedizin Berlin, Campus Berlin-Buch, Experimental and Clinical Research Center, Berlin, Germany
,
M. Riedl
2   Department of Physics, Humboldt-Universität zu Berlin, Berlin, Germany
,
T. Penzel
4   Department of Sleep Medicine, Charité – Universitätsmedizin Berlin, Campus Mitte, Berlin, Germany
,
J. Kurths
2   Department of Physics, Humboldt-Universität zu Berlin, Berlin, Germany
5   Potsdam Institute for Climate Impact Research, Potsdam, Germany
,
N. Wessel
2   Department of Physics, Humboldt-Universität zu Berlin, Berlin, Germany
3   Charité – Universitätsmedizin Berlin, Campus Berlin-Buch, Experimental and Clinical Research Center, Berlin, Germany
› Author Affiliations
Further Information

Publication History

received: 20 November 2009

accepted: 23 February 2010

Publication Date:
17 January 2018 (online)

Preview

Summary

Objectives: Scoring sleep visually based on polysomnography is an important but time-consuming element of sleep medicine. Whereas computer software assists human experts in the assignment of sleep stages to polysomnogram epochs, their performance is usually insufficient. This study evaluates the possibility to fully automatize sleep staging considering the reliability of the sleep stages available from human expert sleep scorers.

Methods: We obtain features from EEG, ECG and respiratory signals of polysomnograms from ten healthy subjects. Using the sleep stages provided by three human experts, we evaluate the performance of linear discriminant analysis on the entire polysomnogram and only on epochs where the three experts agree in their sleep stage scoring.

Results: We show that in polysomnogram intervals, to which all three scorers assign the same sleep stage, our algorithm achieves 90% accuracy. This high rate of agreement with the human experts is accomplished with only a small set of three frequency features from the EEG. We increase the performance to 93% by including ECG and respiration features. In contrast, on intervals of ambiguous sleep stage, the sleep stage classification obtained from our algorithm, agrees with the human consensus scorer in approximately 61%.

Conclusions: These findings suggest that machine classification is highly consistent with human sleep staging and that error in the algorithm’s assignments is rather a problem of lack of well-defined criteria for human experts to judge certain polysomnogram epochs than an insufficiency of computational procedures.